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Response to Amendment 

1 . In response to the Office Action mailed 08 August 2008, applicant submitted a response 
filed 08 December 2008, in which the applicant amended claims 1, 25, 49, 73, 97, 101, 105 and 
109 without adding new matter. 

Response to Arguments 

2. Applicant's arguments filed 08 December 2008 have been fully considered but they are 
not persuasive. Applicants' arguments on pages 33-36 of Remarks fail to comply with 37 
CFR 1.11 1(b) because they amount to a general allegation that the claims define a patentable 
invention without specifically pointing out how the language of the claims patentably 
distinguishes them from the references. Applicant is merely pointing out claimed features and 
unrelated features of the applied references. 

Additionally, applicant admits on page 33 that BANBROOK discloses singular value 
decomposition (SVD), which 'is an important factorization of a rectangular. . .matrix.' 
Therefore, the rejections stand as restated below. 

With respect to applicants' arguments of claim 97 on pages 34-35 of Remarks, the 
examiner respectfully disagrees. COORMAN clearly discloses extracting samples from a 
database of time-domain waveforms and applying said sample data to construct feature data used 
in creating a matrix. 
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Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

(It is noted that the limitations below are grouped together for sake of length and that they are 
identical but for the statutory categories they represent.) 

4. Claims 1, 3-8, 19-20, 25, 27-32, 43-44, 49, 51-56, 67-68, 73, 75-80, 91-92, 97, 99, 101, 
103, 105, 107, 109 and 1 1 1 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Coorman et al (US Patent No. 6,665,641) referred to as COORMAN in view of Michael 
BANBROOK, 'Nonlinear Analysis of Speech From a Synthesis Perspective', A thesis 
submitted for the degree of Doctor of Philosophy at The University of Edinburgh; October 
15, 1996 (Specifically Chapter 4), referred to as BANBROOK hereinafter. 

Claims 1. 25. 49 and 73 : COORMAN discloses a method (as disclosed in COORMAN 
see claims 51-93), computer-readable medium having instructions to cause a machine to perform 
a machine-implemented method (col. 19, lines 20-30), apparatus (i.e., speech synthesizer, see 
Claims 1-50) and a system (see col. 19, 'Conclusion') comprising a processing unit (e.g., 'Text 
Processor') coupled to a memory through a bus (see col. 19, 'Conclusion', which describes the 
flexibility of the system to be implemented on any number of computer systems using a plurality 
of either fixed of removable storage means (e.g., memory).) for analyzing speech for use in 
synthesis, comprising: 
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i. extracting portions from speech segments (col. 5, lines 28-30), the portions surrounding 
a segment boundary within a phoneme (col. 19, lines 1-9); 

ii. identifying time samples form the portions (digitally sampled waveforms, col. 20, lines 
19-20, which are contained within databases.); 

iii. determining a distance between the feature vectors in the vector space (col. 18, lines 

16-19). 

However, COORMAN fails to, but BANBROOK does specifically disclose wherein 
features include phase information of the portions (p. 37, 'The data is projected onto a phase 
space defined by the singular vectors of the data, which can then be partitioned into a signal 
subspace and a noise subspace. ') and wherein deriving feature vectors representing time sample 
portions from a feature space comprises constructing a matrix W from the portions (col. 18, lines 
21-23, 'The calculation of this spectral mismatch is based on a distance calculation between 
spectral vectors . This might be a heavy task as there can be many segment combinations 
possible. In order to reduce the computational complexity a combination matrix— containing the 
spectral distances- could be calculated in advance . ' [emphasis added]). 

However, COORMAN recites performing operations on said matrix containing the 
portions surrounding the segment boundary within the phoneme (i.e., diphone), however failing 
to, but BANBROOK does specifically disclose decomposing the matrix W (p. 37, 'The method 
of singular value decomposition (SVD) reduction, described by Broomhead and King [85, 103], 
addresses this problem. r ). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BANBROOK in the system of COORMAN because it 
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introduces a combination of analysis tools (e.g. time delay embedding, singular value 
decomposition, correlation dimension, local singular value analysis, Lyapunov spectra and short 
term prediction properties) and looks in detail at Lyapunov exponents and two major novel 
modifications are proposed that are demonstrated to be more robust than conventional techniques 
(Abstract). 

Claims 3. 27. 51 and 75 : COORMAN discloses a system as per claims 1, 25, 49 and 73 
above, further comprising extracting global boundary-centric features from the portions (col. 10, 
lines 49-54). 

Claims 4, 28, 52 and 76 : COORMAN discloses a system as per claims 1, 25, 49 and 73 
above, wherein the speech segments each include the segment boundary within the phoneme 
(col. 9, lines 5-8). 

Claims 5. 29. 53 and 77 : COORMAN discloses a system as per claims 4, 28, 52 and 76 
above, wherein the speech segments each include at least one diphone (col. 9, lines 5-8). 

Claims 6, 30, 54 and 78 : COORMAN discloses a system as per claims 5, 29, 53 and 77 
above, wherein the portions include at least one pitch period (col. 19, lines 7-9). 

Claims 7. 31. 55 and 79 : COORMAN, in view of BANBROOK disclose a system as per 
claims 6, 30, 54 and 78 above. However, COORMAN fails to, but BANBROOK does 
specifically disclose wherein decomposing the matrix W comprises performing a pitch 
synchronous (p. 37, 'which can then be partitioned into a signal subspace and a noise 
subspace. ') singular value analysis on the pitch periods of the time-domain segments (p. 37, 'The 
method of singular value decomposition (SVD) reduction, described by Broomhead and King 
[85, 103], addresses this problem. '). 
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Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BANBROOK in the system of COORMAN because of 
the reasons described above. 

Claims 8. 32. 56 and 80 : COORMAN, in view of BANBROOK disclose a system as per 
claims 6, 30, 54 and 78 above. However, COORMAN fails to, but BANBROOK does 
specifically disclose wherein the matrix Wis a 2KM x N matrix represented by 

W= U£V T 

where K is the number of pitch periods near the segment boundary extracted from each segment, 
iVis the maximum number of samples among the pitch periods, M is the number of segments in a 
voice table having a segment boundary within the phoneme, U is the 2KMx R (p. 37, N x w 
trajectory matrix found utilizing time delay embedding) left singular matrix with row vectors ui 
(1 <i< 2KM), E is the R x R diagonal matrix of singular values si > S2 > . . . > sr > 0, Vis the Nx 
R right singular matrix with row vectors Vj (7</ <N), R « 2KM, and T denotes matrix 
transposition, wherein decomposing the matrix ^comprises performing a singular value 
decomposition of W(p. 37-38, X= SZC T , where X is the trajectory matrix, S and C are the 
matrices of the singular vectors associated with Z, which is a diagonal matrix. ). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BANBROOK in the system of COORMAN because of 
the reasons described above. 

Claims 19, 43, 67 and 91 : COORMAN discloses a system as per claims 5, 29, 53 and 77 
above, wherein the portions include centered pitch periods (col. 19, lines 7-9, In the preferred 
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embodiment the length of the trailing and leading regions are of the order of one to two pitch 
periods and the sliding window is bell-shaped [i.e. centered]. '). 

Claims 17-18. 41-42. 65-66 and 89-90 : COORMAN discloses a system as per claims 1, 
25, 49 and 73 above, wherein said distances are associated with said speech segments (units, col. 
11, section 'Cost Functions', lines 46-49, 'a set of nonlinear cost functions has been defined for 
use in the unit selection... with specific properties which help in the unit selection process. r ). 

Claims 20. 44. 68 and 92 : Claims 20, 44, 68 and 92 are similar in scope and content to 
that of claim 8 above and so therefore are rejected under the same rationale. 

Claims 97. 101. 105 and 109 : COORMAN discloses a method (as disclosed in 
COORMAN see claims 51-93), computer-readable medium having instructions to cause a 
machine to perform a machine-implemented method (col. 19, lines 20-30), an apparatus (i.e., 
speech synthesizer, see Claims 1-50) and a system {see col. 19, 'Conclusion') comprising a 
processing unit (e.g., 'Text Processor') coupled to a memory through a bus {see col. 19, 
'Conclusion', which describes the flexibility of the system to be implemented on any number of 
computer systems using a plurality of either fixed of removable storage means (e.g., memory).) 
for analyzing speech for use in synthesis, comprising: 

i. gathering time-domain samples from recorded speech segments (col. 20, lines 17-22, 
'The database may directly contain digitally sampled waveforms , or it may include pointers to 
such waveforms, ' [emphasis supplied]) wherein the time samples include pitch periods 
surrounding a segment boundary within a phoneme (col. 19, lines 1-9); 

ii. deriving feature vectors that represent the time samples in a vector space (col. 4, lines 
23-25, 'The acoustic join cost is based on a quantization of the mel-cepstrum, '); 
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iii. determining a discontinuity between the segments (col. 12, 'Cost Functions for 
Numeric Features', 'Imprecise linguistic or acoustic knowledge, for example, how big a 
discontinuity in pitch can be perceived, '), the discontinuity based on a distance between the 
features {'For example, the mismatch of pitch between phones with the same accentuation (either 
both accented, or both unaccented) in the Transition Cost has a symmetric cost function... '). 

However, COORMAN fails to, but BANBROOK does specifically disclose wherein 
features include phase information of the portions (p. 37, 'The data is projected onto a phase 
space defined by the singular vectors of the data, which can then be partitioned into a signal 
subspace and a noise subspace. ') and wherein creating feature vectors comprises constructing a 
matrix W from the portions (col. 18, lines 21-23, 'The calculation of this spectral mismatch is 
based on a distance calculation between spectral vectors . This might be a heavy task as there 
can be many segment combinations possible. In order to reduce the computational complexity a 
combination matrix-containing the spectral distances- could be calculated in advance . ' 
[emphasis added]). 

However, COORMAN recites performing operations on said matrix containing the time 
domain samples of the pitch periods surrounding the segment boundary preserving phase 
information (p. 37, 'The data is projected onto a phase space defined by the singular vectors of 
the data, which can then be partitioned into a signal subspace and a noise subspace. '), however 
failing to, but BANBROOK does specifically disclose decomposing the matrix W (p. 37, 'The 
method of singular value decomposition (SVD) reduction, described by Broomhead and King 
[85, 103], addresses this problem. '). 
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Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BANBROOK in the system of COORMAN because it 
introduces a combination of analysis tools (e.g. time delay embedding, singular value 
decomposition, correlation dimension, local singular value analysis, Lyapunov spectra and short 
term prediction properties) and looks in detail at Lyapunov exponents and two major novel 
modifications are proposed that are demonstrated to be more robust than conventional techniques 
(Abstract). 

Claims 99. 103. 107 and 1 1 1 : COORMAN discloses a system as per claims 98, 102, 
106, and 1 10 above. However, COORMAN fails to, but BANBROOK does specifically 
disclose wherein features include phase information of the portions (p. 37, 'The data is projected 
onto a phase space defined by the singular vectors of the data, which can then be partitioned into 
a signal subspace and a noise subspace. '). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BANBROOK in the system of COORMAN because of 
the reasons described above. 

5. Claims 9-10, 21-23, 33-34, 45-47, 57-58, 69-71, 81-82 and 93-95 are rejected under 35 
U.S.C. 103(a) as being unpatentable over COORMAN, in view of BANBROOK and in further 
view of ANSARI et al., 'Pitch Modification of Speech Using a Low-Sensitivity Inverse Filter 
Approach'; IEEE Signal Processing Letters; March 1998 referred to as ANSARI hereinafter. 

Claims 9. 33. 57 and 81 : COORMAN, in view of BANBROOK disclose a system as per 
claims 8, 32, 56 and 80 above, however failing to, but ANSARI does specifically disclose 
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padding a signal with zeroes (p. 61, section III, 'in the new method when the residual is modified 
with zero-padding to lower the pitch. '). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of ANSARI in the system of COORMAN, in view of 
BANBROOK because speech modifications using the method of ANSARI are superior in 
quality to those obtained with RELP, while at the same time being less sensitive than RELP to 
errors in pitch marking (Abstract). 

Claims 10. 34. 58 and 82 : COORMAN, in view of BANBROOK disclose a system as 
per claims 9, 33, 57 and 81 above. However, COORMAN fails to, but BANBROOK does 
specifically disclose wherein a feature vector u t is calculated as 

where u, is a row vector associated with a pitch period i, and E is the singular diagonal matrix 
(p.49, 'In general, any matrix A can be written A = QR (4.17) where Q has orthogonal columns 
and R is a square upper-right triangular matrix with positive values on the diagonal. '). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BANBROOK in the system of COORMAN because of 
the reasons described above. 

Claims 21. 45. 69 and 93 : COORMAN, in view of BANBROOK disclose a system as 
per claims 20, 44, 68 and 92 above, however failing to, but ANSARI does specifically disclose 
symmetrically padding a signal with zeroes (p. 61, section III, 'in the new method when the 
residual is modified with zero-padding to lower the pitch. '). It would have been obvious to one 
having ordinary skill in the art that if pitch periods were centered, that one would be motivated to 
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append zeros symmetrically on either side of the centered samples in order to maintain 
symmetric proportions with respect to a centered pitch. 

Claims 22-23. 46-47. 70-71 and 94-95 : Claims 22-23, 46-47, 70-71 and 94-95 are similar 
in scope and content to that of claims 10 and 12 above and so therefore are rejected under the 
same rationale. 

6. Claims 11-15, 35-39, 59-63 and 83-86 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over COORMAN and BANBROOK in view of AN SARI and in further view of 
Jerome R. BELLEGARDA, 'Exploiting Latent Information in Statistical Language 
Modeling' referred to as BELLEGARDA hereinafter. 

Claims 11. 35. 59 and 83 : COORMAN and BANBROOK in view of ANSARI disclose 
a system as per claims 10, 34, 58 and 82 above. However, COORMAN and BANBROOK in 
view of ANSARI fail to, but BELLEGARDA does specifically disclose wherein the distance 
between two feature vectors is determined by a metric comprising the cosine of the angle 
between the two feature vectors (p. 5, 'We conclude that a natural metric to consider for the 
"closeness" between words is therefore the cosine of the angle between u t and Uj. r ). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BELLEGARDA in the system of COORMAN and 
BANBROOK in view of ANSARI because it uses latent semantic analysis to improve statistical 
language modeling utilizing existing clustering techniques capable of aiding in speech 
production by machine (Introduction). 
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Claims 12. 36. 60 and 84 : COORMAN and BANBROOK in view of ANSARI disclose 
a system as per claims 10, 34, 58 and 82 above. However, COORMAN and BANBROOK in 
view of ANSARI fail to, but BELLEGARDA does specifically disclose wherein the metric 
comprises a closeness measure, C, between two feature vectors, Uk and Ui, wherein C is 

calculated as ^ Mi 

for any 1 <k, I < 2KM (p. 6, (10)). 

Therefore, it would have been obvious to one having ordinary skill in the art at the time 
of invention to include the teachings of BELLEGARDA in the system of COORMAN and 
BANBROOK in view of ANSARI because of the reasons described above. 

Claims 13. 37. 61 and 85 : COORMAN and BANBROOK in view of ANSARI and in 
further view of BELLEGARDA disclose a system as per claims 12, 36, 60 and 84 above. The 



examiner is taking Official Notice that the difference as calculated by 

is simply a natural extension from the closeness measure as determined in the prior claim (which 
is assumed a value between 0 and 1). Therefore, it would have been obvious to one having 
ordinary skill in the art at the time of invention to include a difference measure as the counterpart 
to a closeness measure previously determined because it is well known that the two factors have 
an inversely variable relationship, here adding up to 1. 
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Claims 14, 38, 62 and 86 : COORMAN discloses a system as per claims 13, 37, 61 and 
85 above, wherein the calculation for the difference between two segments in the voice table, SI 
and S2, is expanded to include a plurality of pitch periods from each segment (col. 19, lines 1-9). 

Claims 15, 39, 63 and 87 : COORMAN discloses a system as per claims 13, 37, 61 and 
85 above, wherein the difference between two segments in the voice table, Si and S2, is 
associated with a discontinuity between Si and S 2 (col. 18, lines 48-54, 'The major concern of 
waveform concatenation is in avoiding waveform irregularities such as discontinuities and fast 
transients that may occur in the neighborhood of the join... It is thus important to minimize signal 
discontinuities at each junction. '). 

Allowable Subject Matter 

7. Claims 16, 40, 64 and 88 are objected to as being dependent upon a rejected base claim, 
but would be allowable if rewritten in independent form including all of the limitations of the 
base claim and any intervening claims. 

The above claims recite a specific distance formula as follows: 

The above is used as an alternative distance measure that is essentially the relative change 
in similarity that occurs during a concatenation function. More specifically, this alternative 
distance specifically shows wherein a difference is zero only when two identical segments are 
concatenated together; otherwise a difference measure greater than zero exists. While the cited 
prior art references do use distance measures as disclosed, none of the references use an 
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alternative distance measure as specifically disclosed and defined as per claims 16, 40, 64 and 
88. 

Conclusion 

8. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to JUSTIN W. RIDER whose telephone number is (571)270-1068. 
The examiner can normally be reached on Monday - Friday 6:30AM - 3:30PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David R. Hudspeth can be reached on (571) 272-7843. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov 

Should you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO 
Customer Service Representative or access to the automated information system, call 800-786- 
9199 (IN USA OR CANADA) or 571-272-1000. 

/David R Hudspeth/ 

Supervisory Patent Examiner, Art Unit 2626 

/J. W. R./ 

Examiner, Art Unit 2626 
1 1 February 2009 



